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Abstract 

Two parties wish to carry out certain distributed computational tasks, and they are given 
access to a source of correlated random hits. It allows the parties to act in a correlated manner, 
which can be quite useful. But what happens if the shared randomness is not perfect? 

In this work, we initiate the study of the power of different sources of shared randomness in 
communication complexity. This is done in the setting of simultaneous message passing (SMP) 
model of communication complexity, which is one of the most suitable models for studying the 
resource of shared randomness. Toward characterising the power of various sources of shared 
randomness, we introduce a measure for the “quality” of a source - we call it eollision eomplexity. 
Our results show that the collision complexity tightly characterises the power of a (shared) 
randomness resource in the SMP model. 

Of independent interest is our demonstration that even the “weakest” sources of shared 
randomness can in some cases increase the power of SMP substantially: the equality function 
can be solved very efficiently with virtually any nontrivial shared randomness. 


1 Introduction 

One of the central themes of complexity theory is the study various resources of computation and 
their respective power. In this direction many different models of computation have been defined, 
and various resources have been considered - e.g., time, space, randomness, access to powerful 
provers, etc. Among the strongest models where the researchers have been able to prove strong 
lower bounds is the model of communication complexity. 

An important resource in communication complexity and, more generally, in virtually all distributed 
computational settings, is the access to a source of correlated random bits or shared randomness 
(SR): It allows the parties to act in a correlated way in order to perform better and more efficiently 
in their tasks. 

The primary goal of this work is a quantitative characterisation of the utility of different sources 
of shared randomness to parties interested in solving communication problems. To this end, let us 
dehne sources of shared randomness more formally. 
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Definition 1.1 (Source of SR for two parties). A source of shared randomness (also referred to 
as a bipartite distribution) is specified by a distribution p on a domain U x V, where U and V 
correspond to the parts of shared randomness visible to Alice and to Bob, respectively. To solve 
their communication task, the parties are allowed to take as many independent samples as they 
need from the source p. 

To understand the motivation behind the above definition better, it would be useful to consider a 
few examples of sources of shared randomness. Let C/ = R = {0,1}. The following two sources on 
U xV are the well-known settings of private and perfect forms of shared randomness, respectively: 

Ppriv ^{00,01,10,11} 5 Pperf '-—^{00,11}) 


where is defined by: 

Notation 1.2. For a finite set Q, we denote the uniform distribution over it by Uq . 

The above two examples {pperf and Ppriv) are valuable as they indicate that the more familiar 
settings of communication complexity are given as special cases of our model by setting p = ppriv 
or p = Pperf - However, more interesting questions in this work are related to the intermediate cases 
between the “extremes” of Pper/ and Ppriv- 

1.1. Shared randomness in SMP communication: definitions and more examples 

Communication complexity studies how much communication is needed to compute a given function 
on distributed inputs. The model we work with is the two-party simultaneous message passing 
(SMP) model, in which the players, Alice and Bob, each send a message to the referee [19, 2]. The 
SMP model augmented with a shared randomness source p is defined as follows: Let p a source 
of shared randomness on ?7 x R and {ui,vi), {u 2 ,V 2 ), ■ ■ ■, {um,Vm) be independent samples from 
p. In the SMP model with shared distribution p, Alice receives input x and her part of shared 
randomness Ra = {ui,... ,U() £ U^, and Bob receives y and his part of shared randomness Rh = 
{vi,... ,vi) £ V^. Alice and Bob use their parts of input and shared randomness to compute their 
messages, Ma{x,Ra) and Mi,{y,Rf), to the referee who upon receiving these messages outputs an 
answer. A communication protocol determines the value of m and the actions of all the participants 
in the protocol. 

A communication protocol is said to solve a communication problem with error probability 5 if it 
guarantees correct answer with probability at least 1—5 for every allowed input. The communication 
cost of a protocol is the maximum possible total number of bits sent by the players to the referee. 
Given a communication problem /, we denote the minimum cost of a protocol that solves / (with 
error probability < |) in the SMP model with SR source p by SMPp{f). 

Our goal. In this work we consider the case when p is not a perfect source of shared randomness 
and investigate to what extent Alice and Bob can still use p to reduce the communication cost 
(as compared to the case of private randomness). Moreover, we are interested in understanding 
what properties of the source p determine its utility as a resource in the SMP model. Consider the 
following slightly more complicated distributions. 

Example 1.3. Let U = V = {0,1}. 

• “Shared disjointness”: The shared distribution is pdisj := ^{ 00 , 01 , 10 }• 
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• Symmetric Noisy Bits: The shared distribution is pBSCp = (1 “ ‘^p)Pperf + ‘^PPpriv, he. 
PBSCp(0,0) = PBSCp (1,1) = ^ and PBSCp (0,1) =pBSCp (1,0) = 

The subtlety of the problem of determining the power of a general source of shared randomness p 
becomes more clear when one notices that even in the simple case of pdisj vs. pBSCp h is a priori 
unclear which source is stronger than the other (say, for p = 0.8). Even more, it is not obvious that 
it should possible to make a useful comparison between two different sources of shared randomness 
Pi and p 2 which, like the above case of pdisj vs. pBSCp, aiay have quite different forms.^ Maybe 
surprisingly, it turns out that comparison between sources of shared randomness of very different 
form is possible (see Theorem 1.8). To achieve this, we introduce the notion of collision complexity 
of a source (see Section 5), which associates to a source p a real valued function colp : N —^ M^, 
whose rate of growth captures the utility of p as a source of shared randomness. 

Before we continue, let us exclude the following case: 

Definition 1.4. A source of shared randomness is degenerate if it is deterministic from the point 
of view of at least one of the players. 

In other words, a source of shared randomness is non-degenerate if the players can use it to loeally 
emulate private randomness (ppriv)- Degenerate sources are not quite that interesting for the 
purposes of this work as they are effectively no more powerful than the deterministic ones, and 
therefore throughout the paper all sources of shared randomness are assumed to be non-degenerate. 

1.2. Our results 

The first result is an example of an SMP protocol showing that even the weakest sources of shared 
randomness can be sometimes quite powerful. More specifically, we have the following theorem for 
the equality function EQ^: {0,1}"' x {0,1}"' {0,1}, defined to be 1 if x = y, and 0 otherwise. 

Theorem 1.5. Let p be a probability distribution on U x V. If p is not a product distribution, 
then it holds that SMPp{EQ^) = 0(1), where the implied constant in O(-) depends only on p (and 
not on n). 

The above result may look somewhat surprising, as we have SMPpriv{EQ^) = Q{^/n) [2, 19], which 
means that even the presence of slightest correlation in the shared distribution p increases the 
power of SMP significantly in the asymptotic sense, decreasing the communication requirements 
from D(y^) to 0(1). 

The above theorem gives rise the following question: 

Question 1.6. Is it possible that what happened for EQ^^ is generic? More precisely, is the 
effect of having an imperfect source of shared randomness p instead of Pperf limited to at most a 
multiplicative constant blow-up in the communication, i.e. 

SMPp(f) = Op(SMPp,pj(f)) 

for any fixed non-product p? 

Note that since the constant in O(-) in the above may depend on p (but not on the instance size 
n), the above question is not as naive as it may at first appear. For instance, if one trusts the 

^Compare this to the case pssCp vs. pssCg where evidently when \p\ < |g| the first source is more powerful than 
the second as the players can always use private randomness to locally simulate pssCp using pBSCg- 
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(incorrect) intuition that “usefulness” of a source of shared randomness {A,B) ~ p is determined 
by the mutual information between the two sides, I{A; B), then it may appear reasonable to believe 
in the affirmative answer to Question 1.6. 

Nevertheless, the answer to Question 1.6 is negative, and the actual trade-off between SMPp{f) and 
SMPpgrfif) is more subtle, and, as shown in Theorem 1.8, is captured by the collision complexity of 
p. Hence, besides the technical significance of Theorem 1.8, we believe that the fact that collision 
complexity (as opposed to entropic quantities like mutual information, etc.) is the appropriate 
measure for quantifying the power of a shared random source is perhaps a noteworthy conceptual 
message of the result. 

Before stating our main theorem, let us give an (informal) definition of the collision complexity and 
observe some of its basic properties. 

Definition (Collision complexity, informal). For n G N, we say that a two-player protocol produces 
a collision if each player output a subsets of [re], respectively denoted by A and B, such that 
Pr[z G An B] > 1/re for each i G [re].^ The complexity of a protocol is the maximum possible 
. The collision complexity of a source p, denoted by colp(re), is the minimum complexity of 
a protocol that uses (arbitrary number of) samples from p and produces a collision. 

Conceptually, collision complexity quantifies the cost of simulating perfect shared randomness by 
two parties who only have p, and therefore it can be viewed as a measure of quality of p. For 
example, it is clear from the above definition that the players sharing Pperf can always use their 
perfect source of shared randomness to agree on an element i G [re] and both output A = B = {i} 
which shows that Pperf has collision complexity 1, which is optimal. The following basic properties 
of collision complexity also follow easily from the definition. 

Fact 1.7 (Properties of collision complexity). The collision complexity colp : N —>■ M"*" satisfies the 
following: 

i. Basic General Bounds: 1 < colp(re) < ^/n. 

ii. Private and Public Randomness: colp^^^^ = 1 and colpp^.^(re) = ^/n. 
hi. Tensor Product Stability: colp®t(re) = colp(re). 

iv. Decrease of Collision Complexity after Tensoring: colp 0 o-(re) < min{colp(re), colo-(re)}. 

Combining the property (i) and (ii) from the above, we see that the minimal value of collision 
complexity is achieved by Pperf and its maximum value is achieved by Ppriv The main intuition 
here is that the higher the “quality” of the correlation provided by p is, the slower we expect the 
growth of colp(re) to be. 

We can now state our main theorem which uses collision complexity to compare SMPp with SMPp^rf 
(where SMPpgrf is a shorthand for SMPp^^^j). 

^An alternate point of view on this criteria is that the parties want to ensure E[|A n B|] > 1, and furthermore, 
they want the intersecting sets A n B to be distributed more or less evenly across [n]. Of course, without the latter 
condition achieving E[|A n B|] > 1 would have been easy, since the players could otherwise have always outputted 
a fixed i € [nj; however, the evenness condition along with the desire to minimise |A| -|- |B| makes this a non-trivial 
task. 
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Theorem 1.8. For any communication problem f and non-degenerate source of shared randomness 
p, we have 

SMPpif) = d{SMPp,rfif) • colp(n)). (1) 

Moreover, there exists a family of partial function {pn} such that 

SMPp{gn) = 0(colp(n)). (2) 


The first part of Theorem 1.8 states that any source of correlation can “replace” the perfect source 
Pperf = ^{00,11} at the multiplicative cost of colp(n). The second part of the theorem guarantees 
the existence of a family of partial functions with SMPp{gn) exhibiting the behaviour expected 
from the first part of the theorem - which proves tightness of the result, up to poly-logarithmic 
factors. 


The particular family of partial functions in the second part of Theorem 1.8 is defined as follows. 

Definition 1.9 (Gap-inner-product). Let n,m € N and Vi € [n] : Xi,yi € F™. Define the gap- 
inner-product function GAPIPn,m as 


GAPIP1 • • • 1 ^n)i iUl ) ■ ■ ■ ) J/n)) 


0, |{i G [n ]: Xi-yi = 0}\ > 2n/3, 

< 1, |{i G [n]: Xi • ?/i = 1}| > 2n/3, 

_L, otherwise. 

v ■ 


We set pn = GAPIPn where GAPIPn is shorthand for GAPIP^^siogn, assuming that n is a power 
of two in this context. 


Note that the second part of Theorem 1.8 itself consists of both an upper bound and a lower bounds 
on SMPp{GAPIPn) (the main technical challenge being proving the lower bound). 

Our last set of result (presented in the appendix) describes the relation between the rate of growth 
of colp(n) and the hypercontractivity properties of p, using that to give bounds on the collision 
complexity of pdisj ■ 


2 Background and related work 


Simultaneous message passing model. As we alluded to before, the SMP model is partic¬ 
ularly suitable for studying shared randomness. This is due to the fact that the power of this 
model depends crucially on availability of a common random source. It is easy to see that shared 
randomness allows the players to use mixed strategies. 

The classical example of the equality function on n-bit strings demonstrates a problem that requires 
Q{y/n) bits of communication in the SMP model with private randomness only, but can be solved 
by a 0(l)-bit protocol in the SMP model with (perfect) shared randomness. Moreover, we will see 
in Section 4 that any non-trivial form of shared randomness is sufficient for solving the equality 
function by a 0(l)-bit protocol. On the other hand, in virtually all other existing two-party models, 
at most additive logarithmic “saving” in terms of communication complexity can result from the 
availability of shared randomness (as opposed to private randomness only). 

In this work we generally view poly-logarithmic multiplicative factors as insignificant in the context 
of communication complexity, and therefore for our needs SMP model is the most suitable model 
to consider. ^ 

® On the other hand, in a parallel work, Canonne et al. [4] study the effects of shared randomness in the one- and 
two-way settings in the “sub-logarithmic regime” and obtain rather interesting and unexpected results. 
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Collision complexity and measnres of qnality of correlation. Intuitively and informally, 
our notion of collision complexity is a measure of quality of shared randomness pas it satisfies the 
tensor product stability property (Fact 1.7). A number of other “measures of quality” of correla¬ 
tions have been extensively studied. Perhaps the most well-known among those is the maximum 
correlation and hypercontractivity. The latter which is of particular interest due to both its utility 
in a multitude of applications and its striking mathematical elegance. 

The literature on measures of quality of shared randomness (and in particular hypercontractivity) 
is vast and we will not try to give a comprehensive survey here; instead, we refer the reader to 
recent works [1, 7, 13] and references therein. We just briefly note one particular interesting line 
of work on a related problem called non-interactive correlation distillation (NICD) [15, 17, 22]. 
In the two-party NICD, Alice and Bob have access to an unlimited number of independent copies 
of p (just as in our case), and their task is to produce a marginally uniform bit each so that the 
outputs by Alice and Bob agree with the maximum probability. Although both NICD and the 
collision complexity are closely related to the maximum correlation and hypercontractivity, their 
exact relationship is unclear. On the other hand, the relationship between hypercontractivity and 
maximum correlation is evident in many of the works cited above. We discuss the relationship 
between these two notions and collision complexity in Subsection 7.4 and the appendix. 

The direct precursor to this work is a work of Gavinsky, Ito and Wang [8], which, to our knowledge, 
was the hrst that studied different “forms of shared randomness” in communication complexity. 

3 Preliminaries 

Throughout the paper, the base of logarithm is two unless stated otherwise. For G {0,1}"', by 
X ■ y we mean the inner product of x, y as elements of F 2 , i.e. XiPi mod 2. 

Recall from the introduction that by a source of shared randomness p (also sometimes referred to as 
a bipartite distributions) we mean a distribution over a set T, = U x V and for ACS the uniform 
distribution over a set X is denoted by Ux- Also for any distribution p on U x V we denote the 
marginals of p on U and V by pu and py, respectively. 

The main operation on sources of shared randomness is the tensor product. 

Definition 3.1. Let pi and p 2 be two distributions over Ui x Vi and U 2 x V 2 respectively. We 
define a new source of shared randomness pi 0 p 2 over {Ui x U 2 ) x (Vi x V 2 ) by setting 

{pi ® P 2 ) {{x, x'), {y, y')) = pi{x, y) ■ p 2 {x', y') 

for any (x,y) G C/i x Fi and {x',y') £ U 2 x V 2 . 

Maximum Correlation. Let phe a distribution onU xV. The spaces U and V equipped with 
the measures pu and py form two probability spaces which turns the vector space of real-valued 
functions over U and V into a Hilbert space via /(^^)^ and \\g\W = E^....,p^ 9{vY■ 

Given the above, we can dehne the maximum correlation of p as follows. 

Definition 3.2 (Maximum correlation). Given a distribution p onU xV the maximum correlation 
of p is defined 

Cor(p) = sup E [f{u)g{v)] 

f,g {u,v)r^p 

where the supremum is taken over functions satisfying f{u) = E^....,p,^ f{v) =0 and jj/jj 2 = 

Il5l|2 = l. 
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From the above definition it is clear that Cor(/9per/) = Ij which is clearly the largest the maximum 
correlation can be, because one can take f = g for any ||/||2 = 1- On the other hand for any 
product distribution, such as Ppriv, the maximum correlation is zero because in that case by the 
independence we have f{u)f{v) = f{u) E^ f{y) = 0. 

The following follows easily from the definition. 

Lemma 3.3. Let p be a distribution on U x V and /: 17 — )• [0,1] and 5 : ^ ^ [0,1]. Then, 

E f{u)g{v) < E f{u) E f{v) + Cor(p) • \/Var(/) Var(fi(). 

(u,v)^p u 


Proof. Let a = 'Eur^.^p^ f{u) and b = E.^'^py Note that f{u)g{v) can be written as 

a6 + {f{u) — a) ■ {g{v) — b). The lemma follows from noting that (/ — a)/y^Var(/) and {g — 

b)/■\/Yar{g) have mean 0 and variance 1 and the definition of the maximum correlation. □ 

The main attractive feature of maximum correlation is its simple behaviour under tensor product 
of different distributions.^ 

Lemma 3.4 (Witsenhausen [21]). For i G [n], let pi be a probability distribution on Ui x Vi. Then 
it holds that Cor(/9i (g) • • • (g) pn) = maxjgj^j Cor(pj). 

Proof. It is easy to see from the definition that maximum correlation of a distribution p is just the 
second largest singular value of the matrix Ay v = =. The lemma then follows by noting 

’ VPi^ipiv) 

that the singular values of the tensor product of two matrices are given by the pairwise products 
of the singular values of the original matrices. □ 

3.1. Information theory 

Here we give a brief account of some of the concepts from Information Theory. For a more detailed 
introduction containing the omitted proofs, one can consult [6]. 

For a random variables X over a domain X, its entropy is defined as 

The entropy of a random variable X conditioned on Y is the average (according to Y) of entropies 
of random variables X\Y = y. More concisely, we have H{X\Y) = H{XY) — HifY). 

Given random variables X and Y we define their mutual information via 

I{X; Y) = H{X) + H{Y) - H{XY) = H{X) - H{X\Y) = H{Y) - H{Y\X). 

The main fact that we shall need regarding the mutual information is the following. 

Fact 3.5 (Chain rule). For any random variables X,Y, Z,W we have 

I{X-, YZ\W) = I{X- Y\W) + /(X; Z\Y, W). 

^Collision complexity itself also satisfies a nice property under tensor product (though this is essentially by def¬ 
inition and not due to a non-trivial underlying mathematical phenomenon), as explained in Fact 1.7 (iii) and (iv); 
but these are in general too weak to be of much help. 
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Finally, we will need a simple lemma which we use in our analysis. 


Lemma 3.6. Let X he a random variable uniformly distributed over the domain X and let P he 
an event. Then, 

H(X\P)> H(X}-log 

We note that the above lemma does not hold in the current form when X is not uniform. To see 
this, let Xi,X 2 be uniform independent random variables over Xi,X 2 respectively. Let i G {1,2} 
be also uniform and let X = Xg. Taking P to be the event i = 1 and assuming that iTil <C 1^21 
we see that in this case P[{X\P) will be much less than H{X) — 1 which proves the necessity of 
uniformity assumption. 

Proof of Lemma 3.6. We have 

H(X\P) = gPrlX = x|Pl log = E Pr[X = x|Fl - log 

= log(Pr|F|) + Y, P'i.^ = x\P] ■ log 

> log(Pr|P|) + Pr|X = x|P| . log (pjj^^) 

= log(Pr[P]) + Pr[X = x\P] • log |T| 

x&X 

= log(Pr[P].|T|), 

where the latter is clearly equal to H{X) — log( p^|p | ). □ 

3.2. Communication complexity 

We assume some basic familiarity with communication complexity [14]. For a partial function / 
from {0,1}” X {0,1}"' to E, we denote by SMP[f) the SMP communication complexity of / without 
shared randomness with worst-case error probability (over the private coins of Alice and Bob) at 
most 1/3. 

For n > 1, let IPn'- {0,1}"' x {0,1}” —)• {0,1} be the inner product function modulo 2. Chor and 
Goldreich [5] showed that any communication protocol which answers IPn with error probability 1/3 
on average, even in the two-way communication model, must have communication cost at least n/2— 
o(n). The following is an immediate corollary of this. 

Corollary 3.7. For n > 1, the communication complexity of IPn in the SMP model with public 
randomness is in Q(n). 

We will use the following fact originally proved by Newman [18] and rehned by Kushilevitz and 
Nisan [14, Theorem 3.14]. The statement in [14] assumes that the function takes a Boolean value, 
but the proof does not use this assumption. 

Lemma 3.8. Let 0 < e < e' < 1/2 he constants. Any SMP protocol with public randomness for a 
partial function from {0,1}” x {0,1}"' to E with error probability at most e can be converted to 








one which uses only logn + C hits of public randomness with error probability at most s' without 
changing the communication cost, where C = C{e, e') > 0 is a constant depending only on e and e'. 

4 Usefulness of all non-product sources of shared randomness 

In this section, we prove Theorem 1.5 showing an easy but nontrivial use of imperfect shared 
randomness in an SMP protocol for equality function. 

As a consequence of this theorem, we see that the lower bound claimed in the second part of our 
main theorem (Theorem 1.8) does not hold for an arbitrary family of functions {gn}. Thus a good 
choice of a function family was necessary there. 

Beyond its bearing on Theorem 1.8, the result is interesting on its own as it explicitly shows that 
in some settings any form of public shared randomness (i.e. one coming from a non-product source 
p) can have a signihcant impact on the asymptotic communication cost. 

Proof of Theorem 1.5. Because p is not a product distribution, there exist subsets Aa ^ U and Aj, ^ 
V such that 

7 = Pr [tt e Aq A u e Afe] / Pr [n G A^] Pr [u G A^] = 7 '. (3) 

{u,v)^p 'V'^Pb 

A protocol for solving the equality is as follows. Alice and Bob share 2 ”' copies of shared randomness; 
label the 2” i.i.d. pairs of registers containing the shared randomness as {ux,Vx) ~ p for x G {0,1}”. 
Alice defines a = 1 if Ux G A^ and a = 0 otherwise, and sends a to the referee. Similarly, Bob 
defines /3 = 1 if Uy G A;, and /3 = 0 otherwise, and sends (3 to the referee. The referee checks 
whether a = (5 = 1 or not. If x = y, this happens with probability 7 , and otherwise it happens 
with probability 7 '. The referee can tell which is the case with error probability at most 1/3 by 
repeating this protocol for t = 0{\y — which is a constant independent of n. □ 

5 Collision complexity 

In the introduction we gave an informal definition and briefly mentioned some of the basic proper¬ 
ties of collision complexity. In this section, we look at this notion in more detail and provide a more 
formal dehnition. We also give an alternate and more analytic characterisation of collision com¬ 
plexity in terms of a different measure of quality of shared randomness called agreement complexity. 
Beside providing a different perspective on collision complexity, this alternate characterisation will 
be also useful technically at various places later. 

We start our work toward the formal definition of collision complexity by defining a collision pro¬ 
tocol. 

Definition 5.1 (Collision protocol). Let p be a probability distribution on U x V. A p-collision 
protocol ^ for p with domain size n is determined by two functions, A : ^ V[n] and B : U —)• 

V[n] for some G N, such that 

Vf G [n] : Pr [{u,v) ■ i G A{u) n B{v)] > p. (4) 

®Think of p as a decreasing function of n: say, p — l/n. 
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It is sometimes convenient to think of Alice’s and Bob’s functions A and B as randomised mappings, 
and that is allowed: we always assume that p is non-degenerate, so the players can use it to generate 
private randomness (it doesn’t have to be hidden from the other party, as the players are always 
cooperating). 

The complexity of a collision protocol {A,B} is given by the maximum output size: 

colp{n,p,{A,B})=^ meix{\A{u)\,\B{v)\ : {u,v) £ supp{p)} . (5) 

Definition 5.2 (Collision complexity). Let n G N and p = p{n) G [0,1] be a probability parameter 
possibly depending on n. The p-collision complexity of a source p (over a domain of size n) is the 
minimum worst case output size necessary for any protocol achieving a collision probability of p 
for all i G [n]. In other words. 


colp(n,p) = ^inf^colp(n,p, {A, B}), 
where the infimum is over all protocols {£,A,B) satisfying (4). 

The main parameter setting of interest for us here will be p = 1/n and as such what follows we 
shall let 

colp(n) colp(n, I/n). (6) 

Note that the collision complexity of a non-degenerate distribution can be at most 0{y/n) (achiev¬ 
able using private randomness via the birthday paradox). Similarly, the rest of the properties from 
Fact 1.7 easily follow from the definition. 

Our second measure of correlation is agreement complexity. It is closely related to collision com¬ 
plexity, but sometimes will be more convenient to work with. 

Definition 5.3 (Agreement complexity). Let p be a probability distribution on 17 x F. 

An agreement protocol for p is determined by 7 G N and a pair of functions f: —>• [0,1] 

and g:V^^ [0,1]- The cost of this agreement protocol is E[/(ui,...,u^) -|- g{vi,... ,V£)], and 
the success probability of this protocol is E[/(ui, • • • ,Ui)g{vi,... jU^)], where (uj,Uj) ~ p indepen¬ 
dently for all i £ [£]. 

The agreement complexity of p at success probability p, denoted by agrp(p), is the infimum of the 
cost of an agreement protocol for p with success probability at least p. 

Here Alice and Bob output one bit each (as opposed to a subset of [n] in the case of collision 
complexity). The value f{ui,... ,Ui) is the probability that Alice outputs “1”, given her part of 
shared randomness, and similarly for g{vi,... ,V£). The players’ task is to output “1” simultaneously 
with probability at least p, while minimising the sum of the probabilities that each party outputs 
“1” - the corresponding infimum is the agreement complexity. 

5.1. Equivalence of collision and agreement complexities and fnrther properties 

The first lemma provides some basic parameter trade-offs. 

Lemma 5.4. For any positive integers m, n and p £ (0,1) we have: 

colp(n, 1 — (1 — p)™) < mcolp(n,p), colp(mn,p) < mcolp(n,p). (7) 
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Proof. The first inequality follows by repeating a collision protocol m times independently and 
outputting the union of the results. For the second one, repeat the collision protocol m times 
independently to obtain Ai,..., Am T [n] and Bi,..., Bm C [n], and output A = {n(i — 1) + j : i G 
[m\,j G Aj} and B = {n{i — 1) + j: i G [m],j G Bj}. This gives a collision protocol with domain 
size mn, collision probability p, and output size at most mcolp{n,p). □ 

As we mentioned collision and agreement complexities are intimately related. This is captured by 
the following lemma. 

Lemma 5.5. For a bipartite distribution p, n > 1, and 0 < p < 1, it holds that colp{n,p) = 
0(max{l, n agrp(p)}), where the constant in the asymptotic notation does not depend on p, n, 
or p. 

The proof is based on the following idea: If i G [n], a collision protocol A,B) of domain size n can 
be converted to an agreement protocol where Alice outputs 1 \i i G A and Bob outputs 1 i G B. 
By choosing the “best” value of i, we obtain the claimed upper bound on the agreement complexity. 
For the conversion in the opposite direction, Alice and Bob repeat the agreement protocol n times 
in parallel to decide whether their output should contain each i G[n\. By the Chernoff bound, this 
provides the necessary upper bound on the collision complexity after an application of Lemma 5.4. 
The formal proof is as follows: 


Proof. Let {i,A,B) be a collision protocol for p with domain size n, output size colp(n,p), and 
collision probability at least p. For i G [n], let fi{u) = Pr[i G A(rt)] for u G and let gi{v) = 
Pr[i G B{v)\ for v G V^. Because the collision probability of {i,A,B) is at least p, it holds 
that E[fi (u)gi{v)] > p for every i G [n]. Because The output size being is colp(n), it holds 
that |A('u)| < colp{n,p) and |B(F)| < colp{n,p) for all u G and v G V^, and in particular, 
it implies that 

E[|A(u)| + |B(i;)|] = ^ E[fi{u) + gi{v)] < 2colp(n,p). 

i&[n] 

Therefore, there exists i* G [n] such that 


E[/i*(u) +gi-{v)] < 


2colp(n,p) 

n 


Then the pair (/,*, gi *) is an agreement protocol with success probability at least p and cost at 
most 2colp(n,p)/n. Therefore, we have that nagip{p) < 2colp(n,p). 

Next we prove that colp(n,p) < 9nagrp{p) + 48. Because agrp{p) > 2p, this inequality is trivial 
if p > 1/2. For the rest of the proof, assume p < 1/2, and we will prove the inequality in two steps: 


colp(n,p/2) < SnagTpip) + 16, (8) 

colp(n,p) < 3colp(n,p/2). (9) 


Let K > agrp{p). By the dehnition of the agreement complexity, there exists an agreement pro¬ 
tocol {i,f,g) for p with success probability at least p and cost less than c. Let a = E«r..p^^ [/(ft)] 
and b = Evr.^p^^[g{v)], and we have that a < K and b < K. Let T = 3nK + 16. Consider 
the following collision protocol. Alice interprets her random input as {ui,... ,Un) G and 

Bob interprets his random input as (vi,...,Vn) G so that {ui,Vi) is distributed according 

to p^^ for each i G [n] and n pairs (fti, Fi), ..., {un,Vn) are mutually independent. Alice constructs 
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sets A,A C [n] as follows. For each i G [n], i belongs to A with probability f{ui), independently 
of everything else. If 1^41 <T, then she defines A = A; otherwise A = 0. She outputs set A. Bob 
constructs sets B and B in the analogous way, and outputs B. It is clear that the output size of 
this collision protocol is at most t. 

Let i G [n]. By the Chernoff bound, it holds that 

Pr[A ^A\i€AriB]= Pr[|iL| > T \ i e AdB] 

= Pr[| \ {i}| > t — 1 I i G ^ n B] 


< 


1 


ga(n—1) 

e' 


ea{n — 1) 


T- 1 


T-l 


T-1 1 

^ ^ 4’ 

as 


where the last inequality follows from (e/3)^° < 1/4. Similarly, it holds that Pr[B ^ B \ i £ 
A0 B] < 1/4. By union bound, we have that 

Pr[A = A ^ B = B \ i £ A0 B] > 


Therefore, 


Pr[i £ A<r\B] = Pr[i £ Ar\B] Pr[T = A ^ B = B \ i £ A<r\ B] > 


meaning that the collision probability of this collision protocol is greater than p/2. This implies 
that colp(n,p/2) < 2>nK + 16. Because this holds with any K > a,gip{p), inequality (8) follows. 

Inequality (9) follows from (7) by noting that 1 — (1 — p/2)^ > p because 0 < p < 1/2. □ 

6 Simulation of SMPperf protocols in SMPp 

Given the technical tools we developed in the previous section we are ready to prove the first (easy) 
part of Theorem 1.8. 

Proposition 6.1. Let p he a bipartite distribution. Then for any (possibly partial) function f 
over {0,1}"" X {0,1}” we have 

SMPpif) = 0(colp(n) (SMPperfif) + logn)). 

To prove the lemma, we first need a symmetrisation claim which shows that at a small multiplicative 
cost one can always turn a collision protocol into one which treats all elements i £ [n] the same 
way. More formally, we have the following: 

Claim 6.2. Let 0 < s < 1. For a bipartite distribution p and n > 1, there exists a collision 
protocol {£, A, B) with domain size n and output size at most 0(colp(n)) that satisfies the following 
two properties: 


(a) n B{v) = 0]<s. 

(b) Conditioned on the event A(u) Ci B{v) ^ 0, selecting one element from A(u) H B{v) uniformly 
at random produces the uniform distribution over [n]. 
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The constant in the asymptotic notation depends on s, the failure probability which is a small but 
fixed constant, however it does not depend on p or n. 

Proof. We first prove the case where s > l/e + 1/2. Consider the collision protocol used in the proof 
of inequality ( 8 ) in Lemma 5.5 with p = 1/n. This protocol satisfies condition (b) by symmetry. 
Its output size is at most 3reagrp(l/n) +16, which is in 0(colp(n)) by Lemma 5.5. By the Chernoff 
bound, it holds that 


Pr[A ^ i| = Pr||i| > T] < _ . (f^) < (I) 

and similarly Pr[i? ^ B] < 1/4. By union bound, it holds that 


e\T 1 
<4’ 


Pr[A n B = 0] < Pr[Ji nB = 0]+ Pr[A / + Pr[B / B] 


< 1 - 


n 


1 1 
+ 7 + 7 

4 4 


1 1 


This proves the case where s > 1/e + 1/2. 

To prove the general case, repeat this protocol for [log s/log(l/e + 1/2)] times and output the 
union of the results. □ 


Proof of Proposition 6.1. Using randomness reduction, i.e. Lemma 3.8, we know that there exists 
an SMP protocol for / using at most logn + C bits of public perfect randomness achieving 
with error probability at most 2/5. Fix this protocol and set m = logn + C and e = 2/5 and 
s = (1 — 2e)/(4 — 4e). Let a(x,r) to be the message which Alice sends given input x and public 
(perfect) random string r £ and let P{y,r) be the message which Bob sends given input is y 
and randomness r. Let 7 ( 0 , 6 ) G U be the referee’s output given that the message from Alice is a 
and the message from Bob is b. 

Consider the following SMP protocol for / with shared distribution p. 

1. Alice and Bob run the collision protocol in Claim 6.2 with domain size 2"* and parameter s 
specified above. Identify the domain [2"*] with the set of m-bit strings, and let A,B C {0,1}”* 
be the sets that Alice and Bob obtain as a result of running this collision protocol. Alice sends 
the referee each string r in A together with a(x,r), and Bob sends the referee each string r 
in B together with f3{y,r). 

2. The referee outputs an arbitrary answer in E A A f] B = 0. Otherwise, he chooses an 
element r G An B uniformly at random, and he outputs ^{a{x, r), f3{y, r)) (which he can do, 
because the values a{x,r) and I3{y,r) have been sent by Alice and Bob, respectively). 

The complexity of the above protocol is clearly in 0{colp{2'^){SMPp^rf + m)). It is clear that the 
protocol succeeds with probability (1 — s)(l — e) = (3 — 2e)/4 > 1/2. By amplification, we are 
done. □ 
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7 Characterising SMPp{f) via collision complexity: the lower bound 

In the previous section we proved the first part of Theorem 1.8. In this section, we complete what 
we started there by proving the second (and the harder) part of the theorem. The main ingredient 
of the proof is a lower bound on the communication complexity of GAPIPn functions (introduced 
in Definition 1.9) which is the content of Theorem 7.1. 

Theorem 7.1. For any bipartite distribution p, it holds that SMPp{GAPIPn) = D(colp(n)), where 
the constants in the asymptotic notations do not depend on p or n. 

The proof of the above Theorem is rather involved and requires several steps. We shall give an 
exposition of the main ideas behind the proof in the next subsection, and then carry out the proof 
in details in the remaining three subsections. Before delving into that, however, let us observe that 
this Theorem 7.1 together with two easy propositions imply the second half of Theorem 1.8. 

Proposition 7.2. For a bipartite distribution p, it holds that SMPp{GAPIPn) = 0(colp(n) logn). 


Proof. Note that GAPIPn has a straightforward SMP protocol with public randomness with 
cost O(logn) and error probability at most 1/3: Alice and Bob choose a common index i E [n] 
uniformly at random, Alice sends Xi to the referee, and Bob sends yi to the referee. The referee 
simply answers Xi ■ yt. Because this protocol uses logn bits of public randomness, Lemma 6.1 (i) 
implies the claim. □ 

Proposition 7.3. For a bipartite distribution p, it holds that SMPp{GAPIPn) = D(logn). 

Proof. Note that IPm can be reduced to GAPIPn^m by repeating the input vector n times, and in 
particular IPsiogn can be reduced to GAPIPn. By Corollary 3.7, this implies the claim. □ 


From the above we get the following corollary which was our main goal. 


Corollary 7.4 (Second half of Theorem 1.8). There exists a family of partial functions fjx from {0,1}'^ x 
{0,1}^ to {0,1} such that for any bipartite distribution p, it holds that 


SMPp(fjv) 


o(colp(i^) logiv) 

D (max I log N, colp (i^) }) 


(upper bound) 
(lower bound) 


where the constants in the asymptotic notations do not depend on p or N. 

The reason for appearance of terms in the above statement is that the input size to GAPIPn 
is N = nm = 8nlogn which essentially is the main source of the poly-logarithmic gap between 
the upper and lower bounds in the above. However, note that the upper and lower bounds in 
Corollary 7.4 are still quite close, because thanks to inequality (7) we have colp(N/logN) = 
fl,(colp(N)/ log N) —which means that the result is tight up to a factor of O(log^V) for every 
bipartite distribution p. 

7.1. The proof outline of Theorem 7.1 

The proof of Theorem 7.1 goes roughly as follows. Consider a hypothetical protocol for GAPIPn 
with complexity o(colp(n)). We will consider the behaviour of the protocol under the uniformly 
random input distribution—ignoring the promise on the input of GAPIPn for the moment. Because 
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of the low communication complexity of the protocol we can argue (using a “rounding argument” 
which extracts a collision protocol from a GAPIPn protocol by looking at the influential coor¬ 
dinates) that for a typical i G [n] the referee can guess the value of Xi ■ yi correctly only with 
probability at most 1/2 -|- 0(l/poly(n)) where the poly(n) here is a polynomial with super-linear 
growth. 

Next, we will use the repeat the above argument inductively n(n) times to conclude that there exists 
a set L C [n] of size at least 2re/3, such that the referee can distinguish the case when x* • y* = 0 for 
all i G L from the case when Xi ■ yt = 1 for all z G L only with probability 1/2 -|- o(l), where here 
we crucially rely on the fact the bias of the protocol in the previous step was only o(l/n). Finally, 
if we define the input distribution y to be uniform over {(x, y): Vi G L : x, = y*} U {(x, y): Vi G L : 
Xi ^ yi}, then each element in the support of y is a valid input to GAPIPn, but the protocol under 
consideration makes a mistake with respect to y with large probability which is in contradiction 
with the initial hypothesis. 

The above outline essentially contains a full description of our argument except for one subtle 
but important technical point: It turns out that as the hybrid argument progresses, turning the 
distribution of {xi,yi) from uniform to x* • y* = 0 (say) for many coordinates i, the players may 
in principle be able to use the extra shared randomness offered by conditioning on the previous 
rounds to their advantage. All of this essentially amounts to the fact, that in the midst of the 
hybrid argument the players instead of just having access to p, instead have access to the more 
powerful source p®aQ (or p®ai )—where denotes the uniform distribution on (x*,y*) G F™ x F™ 
conditioned on x* ■ y* = b. In fact, we do not know any way to show that the players cannot take 
advantage of the newly introduced shared randomness caused by conditioning, rather we use some 
ideas related to maximum correlation to show that, for our specific choice of parameter m = 8 log n, 
one has 

colp,g,t,o (n) = Q(colp(n)), colpig,ai(n) = Q(colp(n)), (10) 

which suffices for our application. 

We note that similar issues to the one above, i.e. the emergence of additional undesired correlations 
between parties as a result of conditioning, occur in many different contexts in the theoretical 
computer science literature, for example in the context of parallel repetition [10, 20] and also in 
many other places in communication complexity [3, 11]. However, the method we use here to handle 
this issue is quite different from the prior work, and hence, could potentially be of interest for those 
applications as well. 

7.2. Hardness of RIPn- the inductive step 

In our analysis, it is convenient to use a slight variant of the standard SMP model, which we call 
the pseudo-SMP model, with the following properties: 

• The referee can receive his own portion of input. 

• If shared randomness is available, the referee can see both Alice’s and Bob’s part of the shared 
distribution. 

The crucial condition here is the latter, i.e. the fact that Alice and Bob’s shared randomness (but 
not inputs) is visible to the referee. Note that in SMP, the players and the referee cooperate in order 
to solve the problem, giving the referee the above extra power only makes the model stronger; thus. 
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any lower-bound established for pseudo-SMP immediately implies a corresponding lower bound for 
the usual SMP model (possibly with some input the referee).® 

For our lower bound it is natural to introduce some auxiliary communication tasks. First consider 
the following general “guessing” information theoretic task, which is a two-player variant of a 
problem considered previously by Gavinsky, Kempe, Regev, and de Wolf [9]: 

Definition 7.5 (Random access problem). In RA^^s problem, Alice receives X S {0,1}”', Bob 
receives Y G {0,1}”, and the referee receives / G [n]. A pseudo-SMP protocol solves RAn,s 
problem successfully if the referee’s output Z G {0,1}^ U {_L} satisfies 

R^x,y^u[Z = {Xi,Yi) \I = i, Z^Y]> 1-6, Vi G [n]. 


Next we introduce the Random Access Inner Product Problem, which is slightly less information 
theoretic, and closer to GAPIPn- 

Definition 7.6 (Random access inner product problem). Let n,m G N and suppose Xi,yi G F™ 

def 

for i G [n] Denote x = (xi, ...,Xn) and y = (yi,..., yn)- Then RIPn,m{x, y, i) = Xi ■ yt. 


We will write RIPn to address RIPn,8iogn and let I = i denote the event that the third input to 
RIPn{-, ■, I) is fixed to i. 

The main purpose of this subsection is to prove the next lemma, which states that RIPn is a hard 
problem; this is essentially all we need in the next subsection to run our inductive argument for 
lower bounding SMPp{GAPIPn) there. 


Lemma 7.7. Let V be a pseudo-SMP protocol for RIPn using source p and communication 
cost CC-p such that CC-p -|- logn < 41 • colp(n). Then there exists i G [n] such that 


7. <2+0 


/ CCp -\- log n 
\ ncolp(n) 


+ o,i 

n 


( 11 ) 


where 'ji is the probability of success of V given I = i and uniformly random inputs x, y G (F™)”. 

To understand the statement of this lemma, consider the case where colp(n) = a;(logn). Then 
the lemma claims that if the cost CCp of a communication protocol V for RIPn is too small, 
i.e., CCp = o(colp(n)), then there is a coordinate i G [n] such that V answers Xi ■ yi correctly with 
probability at most 1/2 + o{l/n). 

To prove Lemma 7.7, we first show that RAn,s is a hard problem and then reduce RAn,s to RIPn, 
thus establishing that the latter is also hard. More precisely, we prove the following. 


Lemma 7.8. There exists a constant h > 0 such that the following holds. Let V be a pseudo-SMP 
protocol for RAn^s using source p and communication cost CCp such that 36(C'C'-p+log n) < colp(n). 
Then there exists i G [n] such that the probability of referee outputting anything beside T satisfies 


Pr[V{x,y,i) / T] 
x,y 


O 


CCp -I- log n \ 
ncolp(n) J 



where x, y G {0,1}” are chosen uniformly at random. 

®The reason we do this here is that to prove the lower bound in Lemma 7.7 (even for just basic SMP), we seem 
to need a lower bound for RAn.s against the slightly stronger pseudo-SMP model. 

^The implied constants in O(-) and o(-) here are universal and, in particular, do not depend on p or n. 
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Proof. The main idea is to use the protocol V for RAn^s to produce a good collision protocol (over 
domain of size n) by considering the influential coordinates of V. Following the above idea, it turns 
out the collision probability of the protocol we obtain will be directly related to probability that 
the referee did not output T in the original RAn,s protocol which implies the desired result. The 
details are as follows. 

Let {Ra,Rb) ~ denote the shared randomness between Alice and Bob. Let Ma and Mb be 
the random variables representing Alice and Bob’s messages to the referee. We also let Z denote 
the referee’s output. 

Define random variables La and Lb as a. function of Ra = ta, Ma = rua, Rb = fB and Mb = rri), 
by 

La = |i e [n]: H{Xi\RA = rA,MA = rua) < ^ 

Lb = |i e [n]: H{Yi\RB = vb^Mb = mB.) < ^ 

In other words, La (resp. Lb) consists of the coordinates of Alice’s inputs (resp. Bob’s inputs) 
where the referee, who sees both of rA,mA (resp. rB,mB) by definition of pseudo-SMP model® 
have substantial information about. 

Since we are assuming Ma and Mb are small, we expect that typically the sets La and Lb are also 
small. More formally, we show that for any va S 

Pt[\La\ >t\RA = rA]< ( 12 ) 

where the probability is over a random choice of x G {0,1}”. By the dehnition of La we have 

H{X\ \LA\>t,RA = rA) 

< H{Ma I |La| >t,RA = ta) + HiX \ Ma, \La\ >t,RA = ta) 

n 

= CCr + Y,H{X,\X<,,Ma, \La\ >t,RA = va) 

i=l 

n 

< CCr + Y,H{X,\Ma, \La\ >t,RA = va) 

i=l 

< CCr + n-^. 

Now we are ready to describe our collision protocol T = 2C + 4 log re By Lemma 3.6, this implies 
inequality (12). 

Because inequality (12) holds for every choice of ta, it holds without conditioning, i.e. 

Pr[|LA| >t]< 

Setting T = 2C + 4 log re and using the symmetric argument for Bob, we obtain that 

Pr[|LA| >rv|LB| >r] < 4- (13) 

re^ 

®Recall that in the pseudo-SMP model, the referee sees all the shared randomness as well as the messages from 
Alice and Bob. 
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Recall that our goal was to “extract” from V a collision protocol Vcoi- This is given as follows; In 
Vcoi Alice and Bob pick random inputs X,Y ^ respective half of the shared randomness 

{Ra, Rb) ~ and sample Ma and Mb by simulating the action of the players in V. The players 
can each locally compute as La, Lb as these are a deterministic functions of MaRa and MbRb 
respectively. The output of the protocol is L'^,L'^ C [n], where 


J La 


\La\ <T 
\La\ >T 


j, def 1 Lb 

\ a 


\Lb\<T 

\Lb\>T 


The analysis of collision protocol. Let a = minjg[„] Pr[i G La H Lb]- From (13) it follows that 
Mi G [n] : Pr[i G n L'g] > a - 

This means that 


colp { n,a - K ] <T. 




By inequality (7), this implies that 


colp(n) < T ■ 


— ln(l — 1/n) 


— ln(l — a + ^jrR') 


< T- 1 + 


2/n 


a —2 jn^ ) ’ 


where the second inequality follows from — ln(l — x) < 2x for 0 < x < 1/2 and — ln(l — y) > y 
for y < 1. If T < colp(n), then this implies that 


4 T 
n colp(n) 



(14) 


Next we need to related a to the probability that the referee outputs T in V. Combined with 
(14) this will finish the proof. Suppose 5 < ^. Since P is a correct protocol for RAn S and since 
h{x) = —xlogx — (1 — x)log(l — x) < 1/6 for 0 < X < (5, we have 


H{Xi I RaRbMaMb, I = i,Z ^ L) < I. 

6 

Since Xi and RbMb are independent conditioned on Ra and Ma, we immediately see that 

Y,rA,niB[H{Xi\RA = rA,MA = mA) \ I = i z ^ -.T] < 

D 

and so by Markov’s inequality we see that 


Pr[z G La \ I = i, Z ^ 1\> 2/3. 

Using the similar argument for Bob and the union bound, we obtain that Pr[i G La Lb \ I = 
i, Z 7 ^ T] > 1/3. Thus, for all i G [n] we have 


Pr[/ = i, Z 7 ^ T] < 3 • Pr[z G La H Lb, I = i, Z ^ L] 
< Pr[i G Lyi n Lb, / = i] 

= Pr[i G L^ n Lb] ■ Pr[/ = i], 


which after dividing by Pr[/ = i] = ^ gives us 

Pr[Z 7 ^ T I I = i] < 3Pr[i G La H Lb] = 3a. 


(15) 
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Combining (15) and (14) we have 


Pr[Z / _L I / = i] < 3a 


O 


CCp + logn ^ 1 \ 
n • colp(n) J ’ 


which finishes the proof. 


□ 


It remains to see that RA^^s can be reduced to RIPn- Here is where the fact that our lower 
bound for RA^^s in Lemma 7.8 is against the stronger pseudo-SMP model comes handy as our 
reduction from RA^^s to RIPn only produces a pseudo-SMP RAn ,5 protocol—even if the original 
RIPn protocol was a standard SMP protocol. Also, here is one place in the argument where the fact 
that the source p is non-degenerate becomes important because by the definition of pseudo-SMP 
and the fact that p is non-degenerate, we can assume Alice and the referee (and symmetrically also 
Bob and the referee) share as much shared randomness as desired. 

Lemma 7.9. Let V he a pseudo-SMP protocol for RIPn using shared distribution p, and define 7 ^ 
be the probability, over the uniform input distribution over {x,y) € (F™)” x (F™)"', that V gives 
the correct answer conditioned on I = i. Then, there exists a pseudo-SMP protocol V' with the 
same communication cost and solves RAn,s with 5 = o(l/re) such that 

Pt[V'{X, y, I) / L] > 27 , - 1 - o . (16) 


In the proof we denote Alice’s and Bob’s messages to the referee in Protocol V by Ma = MA{ai, ■ ■ ■, Un, Ra) 

Mb = Msibi,... ,bn, Rb)- The referee’s answer will be denoted by Z = Z{i, Ra, Rb, Ma, Mb) S 
{0,1}2U{±}. 

Proof. Let us describe a protocol V' for RAn,5 (derived from protocol V for RIPn). 

1. Alice and Bob receive x,y € {0,1}"' and their part of shared randomness, Ra, Rb- The referee 
sees both Ra and Rb. 

2. Alice and the referee use the randomness they share (independent of Ra) to choose uniformly 
at random 2 n values 0^7 € F™, for (j, t) £ [n] £ { 0 , 1 }. Similarly, Bob and the referee use 
their shared randomness to choose 6^7 £ Vff for (j, t) £ [n] x {0,1} at random. 

3 . Alice and Bob send the referee messages Ma = MA{ai^xi, ■ ■ ■, <in,xn,RA) and Mb = MB{bi^y-^ , bn,yn,RB) 
where Ma and Mb are as in V. 

4 . For each choice of si,..., £ F™, the referee computes 

.,s„ = Z{Ra, Rb, Ma, Mb{si, ... ,Sn, Rb)) S {0,1}, 
and also (Isi,...,sn defined similarly. 

5 . The referee computes za,zb S { 0 , 1 , T} defined as follows. If and • Si agree for 

more than 2"*^(l/2 -|- 1/2™-/^) choices of (si,...,s„) £ (F^)"', then let 27 = 0; otherwise, 
if asi,...,s„ and 0*7 • Si agree for more than 2™'"'(l/2 -|- 1/2™/^) choices of (si,..., Sn) £ (F^)", 
then let 27 = 1 ; if neither holds, let 2:7 = _L. Define zb analogously. If either 27 or zb are _L 
the referee outputs Z = T and otherwise he outputs Z = (27, zb). 
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First we analyse the probability that V' outputs ± to confirm (16). Fix i, Ra, Rb,xi, ..., Xn, yi, ■ ■ ■ ,yn 
and consider aj^t and bj^t as the random variables. Note that if V' outputs _L, then one of the fol¬ 
lowing events has to occur: 


eA : #{(si,..., s„) e (F™)” : 

es : #{(si, ■ ■ ■, Sn) G (F2")"': f3si,...,s, 


= ai^.-Si}<2^^ ( - + ^ 


1 1 


= k,y. • < 2-" - + 


2 / ' 


Let e = ei Vei. Let error denote the event that the referee from V given Ra^ Rb, Ma, Mb and i 
would give a different answer from Ui^xi ' k,yi- Then by definition Pr[error | I = i] = 1 — 7 *. On 
the other hand, Pr[error |eA/ = i] >1/2 — 2 and therefore 


Pr[e \ I = i] 


Pr[error A e | / = z] ^ Pr[error \ I = i] ^ 
Pr[error | e A / = z] “ Pr[error | e A / = z] “ 


1 - 7 » 

1/2 - 2-"*/3 


< 2 - + 


where the last inequality follows from our assumption that m > 8. Therefore, the probability that 
V' returns an answer different from “T” when / = z is at least 27 ^ — 1 — 2^“™'/3 = 27 ^ — 1 — o(l). 
So it just remains to show that condition on being different from T, the output of V is {xi,yi) at 
least 1 — 0 ( 1 ). To do so we employ a simple Fourier analytic (orthogonality) argument. 

Let h:¥^ ^ [-1,1] be 


h{y) = E 


a 


Sl,...,Si-l,y,Si+l,...,SnJ I 


and define the random variable BAD by 


BAD = |z G F^ : Eh{y) • (-1)^'^ > 2^-™/3 

Moreover, note that although BAD depends on Ra, Rb, MA,ai^xi:Ci 2 ,x 2 ^ ■ ■ ■ it is crucially in¬ 

dependent of the choice of aiA-xi,a 2 ,i-x 2 i ■ ■ ■ ■i0.n,i-xn- On the other hand, the probability that 
2 ;a = o^iA-xi, which corresponds to the referee’s error in identifying Alice’s input, is upper bounded 
by Fr[aiA-xi G BAD] which is equal to because ai^-xi was chosen uniformly at random inde¬ 
pendently of BAD. 

Finally, by basic Fourier analysis (i.e. the orthogonality of the characters y e-)• (- 1 )^'^) it follows 
that 

< 21-^3 = 

This means that condition on za 7 ^ T, the referee with probability 1 — o(l) predicts the correct 
Qi^xi, and hence the correct Xi G {0,1}, as Alice’s input. Repeating the same argument for Bob the 
result follows. 

□ 



Proof of Lemma 7.7. This follows immediately from Lemma 7.8 and 7.9. 


□ 
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7.3. Hardness of gapped inner product 

The hardness of RIPn can be used as an inductive step to prove the desired lower bound for GAPIPn 
via a hybrid argument. Before carrying this out, let us establish some useful notations. 

For a bit 6 € {0,1} and a positive integer m (as usual here m = 81ogn), we define am,b to be the 
uniform distribution over {(u, v) G F™ x F™: u ■ v = b}. 

A slight (and ultimately insignificant) issue that arises in our proof is the asymmetry of 6 = 0 and 
6 = 1 cases, i.e. the fact that it is slightly more probable for the inner product of two random 
x,y £ F™ to be 0 than 1.® To make the notation related to this issue simpler, we let be 
the probability of two randomly chosen x,y £ F™ satisfy x ■ y = b] that is, 7 o = 5 + 2 m+;i and 
7i ^ 2^rr+T • 

Lemma 7.10. Let m and < 7^,6 be as in the above paragraph. Then we have 

SMPp{GAPIPn) = ll(mincolp0^^(,(n)), 

h 

where the constant factor in the asymptotic notation does not depend on n or p. 

As we noted before, the reason colp^o-,^ ^(n) as opposed to colp(n) appear in the above is that 
starting from the second step of our induction we will fix the inner product of some of the Xi ■ yi 
for some i £ Lt which the players could presumably take advantage of as an extra source of 
shared randomness. In the next section, we show that with our choice of parameter m = 81ogn 
min{colp,g,o-^Q(n),colp(g,o-^p(n)} = n(colp(n)), and hence the players are in fact unable to take 
advantage of the extra shared randomness caused by conditioning; combined with Lemma 7.10 
that will hnish the proof. 

Remark 7.11. We shall note that even in the midst of the hybrid argument, there is still a minor 
difference between the original source p and the auxiliary new source ( 7 ^, 6 , in that unlike in the 
case of yO, the players can only sample Um,b for limited number of times, i.e. t < However, we 

are not able to, or rather do not need, to exploit this fine difference in the proof. 

Proof. Note that if min^ colp(g,o- 6 (? 7 ) < Clogn (for suitably large constant C to be determined later) 
then the result immediately follows from Proposition 7.3. So in what follows we focus on the other 
case. 

We denote the shared randomness between Alice and Bob by {Ra,Rb) ~ and their in¬ 
put by {x,y) £ (F™)"" x (F™)”. We will denote Alice’s and Bob’s messages to the referee by 
f{RA,x),g{RB,y) £ {0,1respectively. For x,y £ (F^)"', u £ U^, v £ V^, ands,f G {0,1}*^*"^, 
let TTx^y{u,v, s,t) be the conditional probability that the values of the shared random variables 
are {u,v) and the messages from Alice and Bob are s and t, respectively, given the input {x,y). 
Note that this probability is well-defined even if the input (x, y) does not satisfy the promise 
of GAPIPn- Then TTx,y is a probability distribution on x x { 0 , x { 0 ,. 

For L C [n] and 6 G {0,1}, let 

Sb,L = {{x, y) £ (F^)- X (F^)": X, • y, = 6 (Vi G L)}. 

®Note that it does not seem possible to immediately dispose of this asymmetry because Lemma 7.7, the basis of 
our induction, relies on the uniform distribution on inputs which has this asymmetry inherent with it. It is quite 
conceivable that we could have established a variant of Lemma 7.7 with a modified distribution eliminating the 
asymmetry, but it is likely that this would have just amounted to a reshuffling of the argument. 


21 





Let T = [2n/3]. Then Lemma 7.7 implies that if 6 G {0,1}, L C [n], and \L\ < T, then there 
exists i G [n] \ L such that 

11(1 — lb)'E(x,y)£Sb,L\Sb,Lu{i}^'^^’y^ “ ^ 

where here we took the advantage of the fact that n — |L| = fl(n) to absorb the resulting constant, 
due to the shrinkage n n \ L, in the asymptotic notation. Noting that Ej-^, ^ = (1 — 

'yb)^{x,y)eSb,L\Sb ,LU{i} + 7b^{x,y)&Sb ,Lu{i} ’ that 


cc^yn_Y 

ncolp^abin) J \nj 


l|EKj;)G5,,as,.^uo}Ks/] ^ ( n00^6(«) ) ^ 

Applying induction and the triangle inequality we see that there exist size T sets Lq,Li C [n] such 
that 


II (270) E(a;,j/)G 5 o,LQ ['^x,y] 


(271) E(a;,j/)G 5 i,_Lj [^ 3 ;,j/] 111 O 


f CCy + iogn \ 

Vminfe colp0o-j,(n)y 


+ 0 ( 1 ). 


Noting that with our choice of parameters ( 270 )^ and ( 271 )^ are both 1 + o(l) the above implies 


|E(a;,j/)g 5 o,z,o ['’^x,y\ 


E(x,y)eSi,i^ 


VT: 


x,y\ 


r CCv + \ogn \ 

Vminfe colp,g,^(, (n) J 


+ 0 ( 1 ). 


On the other hand, every input (x, y) G Sh,Lb is a valid input in support of GAPIPn, and therefore 
the correctness of the protocol implies that ||E(x,y)GSo lq ~ E(x,p)gSii,j Ka:, 2 /]lli — f- Plugging 
this in the above and using the assumption that min;, colp^o-^ (n) is larger than sufficiently large 
constant multiple of logn the result follows—which is something we can assume without loss of 
generality as seen from the discussion in the beginning of the proof. □ 


7.4. Removal of additional shared randomness caused by conditioning 

In this Section we prove the next Lemma which Lemma 7.10 immediately imply Theorem 7.1. 

Lemma 7.12. Let am,b be the uniform distribution over {{x,y) G {0,1}”* x {0,1}”*: x ■ y = b} 
and let a be the either of am,o or am,i- If n < 2 ”*/^“^ then for any bipartite distribution p we 
have colp(g)o-(re) = n(colp(n)), where the constant factor is independent of m,n or p. 

We need two somewhat more technical statements to prove this Lemma. The first one is a basic 
estimate on the correlation complexity a which follows from elementary Fourier analysis. 

Claim 7.13. Let m > 2 and h G {0,1}. Let a be the uniform distribution over {(u,v) G {0,1}”* x 
{0,1}”* : u ■ V = b}. Then it holds that Cor(cr) < 1/2”*/^“^. 


Proof. For / :{0,1}”*—)-M, the Fourier transform of / is defined by 

nG{ 0 ,l}"* 


Parseval’s identity states that 

^ J] f{uf= 
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Let fj, be the marginal distribution of cr on either part. Let f,g: {0,1}™ —)• M, and assume 
that = Bvr^fi[g{v)] = 0 and that Bur-.fiifiu)'^] = Bvr-.fiigiv)'^] = 1. Because g{u) > 

lj{2'^ + 1 ) for all u € { 0 , 1 }™, it holds that 

E /(«)'<(2™ + l)E.^^[/(n)2]=2™ + l, 


and similarly < 2 ™ + 1 . 

Simple calculation shows that 


B(u,v)^Af(.u)g{v)] 

1 


22m-l . 2 ^ 


T 9{v) Y1 - 


(l)fe . 2 ™ + 1 


Bur^Af (^)] 


DSfO,!}" 


\u:u-v=b 


22 m (- 1 )^ • 2 ™ 


E ( 2 ”/'(’') -/(0))9(’') 


«e{o,i}"* 


X] Av)9{ 


V - 


f(0) 


2 ™+ (-!)'> ^ J 2"^^ + i-lf ■ 2^' 




2 ™ + (- 


KAAv) + 


i;e{o,i} 


(-l)V( 0 )g( 0 ) 
22 m (-l)f' . 2” 


By the Cauchy-Schwarz inequality, the summation in the first term is bounded as 


X] fiAaiv) < 




and therefore we have that 


«e{o,i}" 


1 


/ 9{vy 

DGfO,!}"^ 


Y / E aivY' 


< 


2 m /2 

2 ™ + 1 
2 m /2 ’ 


JiSfO,!}" 




E 


[u,v)^a[f{u)9{v)] < 


2 ™ + 1 


(-i)V(0)g(0) 

2 ™ + (-l)^ ' 22 ™ +(- 1 )^- 2 ” 


+ 


(18) 


If 6 = 0, then/(O)^ < Bu^AfiA '^]/= (2™+ l)/2, and similarly 5 ( 0 )^ < (2™ + l)/2. Therefore, 
(18) implies that 

11 1 

B[u,v)^Af iA9{A] ^ 2m/2 2™+r ^ 2™/2-1 ■ 

If 6 = 1, then the vector 0 G {0,1}™ is not in the support of g, and therefore we may assume 
that /(O) = 0 without loss of generality. Then (18) implies that 

1 2 ™ +1 1 

B(u,v)r~.a[f {A9 {v)] < • 2 m _ I ^ 2 ™/ 2-1 ' 

□ 
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Claim 7.14. Let /? be a bipartite distribution on [/ x y, and cr be a bipartite distribution on X xY. 
Then for p with Cor(cj) < p < 1, it holds that agiCp^^{p) > agrp(p — Cor((T)). 

Proof. Let c > agiCp^^{p). Let {£, /, g) be an agreement protocol for piSicr with success probability p 
and cost at most c. By Lemmas 3.4, it holds that Cor((T®^) = Cor(cj). 

For u e let F{u) = [/(«,x)]. Similarly, for v e let G{v) = 

Fix u G and v Because / and g take values in [0,1], their variances for fixed u and v are 

at most 1. By Lemma 3.3, we have that 

F^^:,^y)^a®i[f{u,x)g{v,y)\ < F{u)G{v) + Cor(fj). 

Then we have that 


P = F‘i^u,v,x,y)r^p®f^®a®t[f{'^^x)g{v,y)] < E(u,v)r..^p^4F{'^)G{v)] + Cor((T). 

This means that {i, F, G) is an agreement protocol for p with cost at most c and success probability 
at least p — Cor((T), and therefore agrp(p — Cor(a)} < c. Because this holds for any c > agip^^(p}, 
we conclude that agrp(p — Cor ( it )) < agrp^^(p). □ 

Proof of Lemma 7.12. Because n < 2'”/^“^, Lemma 7.13 implies that Cor((T) < l/(2n). Then by 
Lemma 7.14, it holds that 


The lemma follows from Lemma 5.5. □ 
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A Relation to hypercontractivity 


As we discussed, collision and agreement complexities are instances of measures of quality of cor¬ 
relation, and as such one natural way to analyse collision complexity is to relate it to other more 
widely used measures of quality of correlation. Indeed, in the case of maximum correlation, this 
was done in Subsection 7.4 and that connection, i.e. 

agrpip) > Vp- Cor(p), (19) 

played an important role in the final stage of our argument. 

In this Section, our goal is to investigate the connection between collision and agreement complex¬ 
ities and another widely used measure of quality of correlation, i.e. hypercontractivity. As we shall 
show (and not surprisingly) the information offered by hypercontractivity properties of a source p, 
when available, is usually more powerful that what can be understood from the simple information 
provided by the maximum correlation. More precisely, although an upper bound on the maximum 
correlation of p can be useful for lower bounding the collision complexity when the size of p is not 
too small compared to the domain size ^ (the agreement parameter), the estimates such as (19) 
stop being useful for very small p, which is necessary for an asymptotic lower bound on the collision 
complexity colp(n) as n —)• oo. 

The organisation of this appendix is as follows. We first start by discussing some of the elementary 
(and well-known) properties of hypercontractivity in the first subsection; the material here is stan¬ 
dard but we included them for the convenience of the reader. Next, we establish the connection 
between the collision complexity and hypercontractivity which is the central result of the appendix. 
Finally in the last subsection we give a quick application of the preceding result by proving some 
simple (but non-trivial) upper and lower bounds for the collision complexity of pdisj- 

A.l. Preliminaries on hypercontractivity 

A generalisation of Holder’s inequality provides a way to quantify the independence of a bipartite 
distribution. 

For p > 1, a probability distribution p, on X, and a function f: X ^ C, the Lp-norm of / with 
respect to p, denoted by ||/||ip(p), is 

For p = oo, the Loo-norm of / with respect to p is dehned by 

If 1 < p,p' < oo and 1/p -|- 1/p' = 1, then Holder’s inequality states that for any probability 
distribution p on U x V and any functions /: 1/ —)• C and p: H —>■ C, it holds that 

\E{u,v)^p[f{u)9iv)]\ < \\f\\Lp{pu)\\9\\u^{pv)^ 

where pu and pv are the marginal distributions oi p on U and V, respectively. If p is a product 
distribution, then it holds that 

lE(u,v)'^pU^Py [fiu)g{v)]\ < ||/|Il,(p^)||5|Il,(p^)- 

We consider a property between these two cases. 
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Definition A.l (Generalized {p,q) Holder inequality). Let p,q >1. A probability distribution p 
on ?7 X H is said to satisfy the generalised {p, q) Holder inequality if for any functions /: C/ —)■ C 
and g: V —>■ C, it holds that 


\E(u,v)^p[f{u)g{v)] \ < ll/llL^(py)lbllL,(p^), 
where pu and pv are the marginal distributions of p on 17 and V, respectively. 


It is often more convenient to prove this via hypercontractivity of a linear operator. Note that a 
bipartite probability distribution p onU xV defines a stochastic channel from V to U and therefore 
also defines a linear operator from to C^: 


{T,f)(v) 


E 

U£U 


p{u,v) 

pv{v) 


fiu). 


For 1 < p < q, this operator Tp is said to be q-to-p hypercontractive with respect to distribution pjj 
if for any /: U —)• C, it holds that 


W'^pfWlqipv) - \\f\\Lp{pu)- 

In this case, we say that the bipartite distribution p itself is q-to-p hypercontractive. Note that 
the order of sets U and V matters in this terminology. The key relation between the generalised 
Holder inequality and the hypercontractivity which we will use is the following: 

Lemma A.2. Let p,q,q' S [l,oo] satisfy p < q and l/q + 1/q' = 1. Let p be a probability 
distribution on U x V. If p is q-to-p hypercontractive, then p satisfies the {p, q') generalised Holder 
inequality. 

Proof. For /: 17 —^ C and g: V ^ C, it holds that 

\'E{u,v)^p[fiu)g{v)]\ = \Evr^pv[{Tpf){v)g{v)]\ 

^ \\'^pf\\Lg{pv)\\9\\L^,{pv) 

— IIS'llLqdPv)’ 

where the first inequality follows from Holder’s inequality and the second inequality follows from 
the q-to-p hypercontractivity of p. □ 

Like the maximum correlation, hypercontractivity tensorizes. 

Lemma A.3. Let 1 < p < q < oo. For i G [n], let pi be a probability distribution on Ui x V). If pi 
is q-to-p hypercontractive for all i G [n], then pi 0 ■ ■ ■ 0 pn is also q-to-p hypercontractive. 

A proof of Lemma A.3 is standard and follows from the fact that (g'/p)-norm on vectors satisfies the 
triangle inequality (Minkowski’s inequality). The proof is essentially the same as that of Lemma 5.3 
of [12] and of Proposition 3.11 of [16]. 
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A.2. Hypercontractivity implies high collision complexity 

The following lemma states that hypercontractivity of p implies an asymptotic lower bound on the 
collision complexity. 


Lemma A.4. Let p he a prohahility distribution on U x V. Let 1 < p < q, and let q' and c 
he 1/q + 1/q' = 1 and 1/c = 1/p + 1/q'. If p is q-to-p-hypercontractive, then for any z G [0,1], it 


holds that 




{p^lPq'LLzY 


and in particular, 

colp(n) = 


As a sanity check note that in the above since p > 1 and q < oo we have ^ | ^ < 2. Hence, 

in the Lemma c > | which is in agreement with what we expect from the first part of Fact 1.7. 


Proof. Let {£, /, g) be an agreement protocol for p with success probability z, and let 

a = E[fiu)] = 
b = B[g{v)] = \\g\\L,(py)- 

We will prove that 

a + b> -{p^/Pq'^^'^'zy. 
c 

Lemma A.3 implies that p^^ is q-to-p hypercontractive, and therefore Lemma A.2 implies that 
satishes the {p, q') generalised Holder inequality: 


2 = E(^u,v)r^p<S>4fiu)g{v)] < \\f\\Lj,{pf)\\s\\L^Ppf)- 
Because \\f\\Lao{pu) — holds that 

and therefore Similarly, it holds that > \\g\\j^ and therefore > 

z. 


By concavity of logarithm, it holds that 

log(c(o + 6)) = log 


(l/p) ■ pa+ {1/q') ■ q'b 


> 


1/p + 1/q' 

{1/p) log (pa) + {1/q') log{q'b) 


1/p + 1/q' 

= c ■ ((1/p) log(pa) + {1/q') \og{q'b)) 


By taking the exponentials of both sides, we obtain that 

c{a + b) > {p^lPq'^/i'a^/Pb^/i')'^ > {p^/^q'^/'^'zf. 


This means that the agreement complexity of p satisfies that agr^(z) > {p^/^q'^/^'z)''^/c. By 
Lemma 5.5, we have that colp(n) = fl{n agip{l/n)) = n(n^“'^). □ 
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A.3. Analysing the collision complexity of pdisj 

In this section, we analyse the collision complexity of Pdisj show that it is strictly in between the 
extremes of a constant and ^/n. The lower bound gives us an opportunity to show the applicability 
of the techniques developed in the previous parts of the Appendix in this simple setting. 

Proposition A. 5. Let p = pdisj be as in Example 1.3. We have, 


A.3.1 Upper bound 

Let 0 < p < 1, and consider the following agreement protocol {i,f,g). Let £ = [logg(l/p)J. Define 

.. ^ jl, xi = --- = xe = I, , . ^l/2^ y\ = --- = yi = b, 

L[xx,...,x() = \ . , 5(yi,---,y£) = < 

I U, otherwise. ID, otherwise. 

We claim that this agreement protocol has success probability at least p and cost less than ^. 
Note that by definition of distribution p, we have that 

Pr[3:i = • • • = = 1 A yi = • • • = i/£ = 0] = ^. 

Therefore, it holds that the success probability of the protocol {I, /, g) is 

E[/(xi,.. .,X()g{yx,.. .,ye)] = ^ Pr[3:i = ■ ■ ■ = xe = 1 A yi = ■ ■ ■ = ye = 0] = ^ ■ ^ > p. 


The cost of this protocol is equal to 

+ 


1 /2\^ 2 


6 


< 


6 


2^ 13/ 3^ 3^+1 3iog6(i/p) 




The upper bound colp^^^^.(n) = follows from Lemma 5.5. 


A.3.2 Lower bound 


We claim that p = pdisj is 3-to-3/2 hypercontractive. Let /: {0,1} —)• C, and we will prove 
that llTp/llg < 11 / 113 / 2 - ® = |/(0)| and /3 = |/(1)|. Because 


(Pp/)(0) = 


/(0) + /(l) 


(r,/)(i) = /(o), 


we have that 


\Tpfh = 


/(0) + /(l) 


1/3 


3/2 




+ 3i/(o)r 

\ 2/3 


< 


2 {a + ^ 


1/3 


+ 3“ 
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Simple calculations show that 


ii/iii/2 - nnl = > 0, 

establishing the claim that p = p^isj is 3-to-3/2 hypercontractive. The lower bound colp^.^^. (n) 
follows from Lemma A.4. 
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